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INTERNATIONAL BUSINESS MACHINES CORPORATION 



ADAPTIVE ENCODING OF A SEQUENCE OF STILL FRAMES OR 
PARTIALLY STILL FRAMES WITHIN MOTION VIDEO 

Technical Field 

This invention relates in general to compression 
5 of digital visual images, and more particularly, to a 
technique for encoding a series of still frames 
within a moving video sequence using image statistics 
derived from the video sequence to dynamically change 
one or more controllable encoding parameter (s) used 
10 in encoding the series of still frames. 

Background of the Invention 

Within the past decade, the advent of world-wide 
electronic communications systems has enhanced the 
way in which people can send and receive information. 

15 In particular, the capabilities of real-time video 
and audio systems have greatly improved in recent 
years. In order to provide services such as video- 
on-demand and video conferencing to subscribers, an 
enormous amount of network bandwidth is required. In 

2 0 fact, network bandwidth is often the main inhibitor 
in the effectiveness of such systems. 

In order to overcome the constraints imposed by 
networks, compression systems have emerged. These 
systems reduce the amount of video and audio data 
25 which must be transmitted by removing redundancy in 
the picture sequence. At the receiving end, the 
picture sequence is uncompressed and may be displayed 
in real-time. 
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One example of a video compression standard is 
the Moving Picture Experts Group ("MPEG") standard. 
Within the MPEG standard, video compression is 
defined both within a given picture and between 
5 pictures. Video compression within a picture is 

accomplished by conversion of the digital image from 
the time domain to the frequency domain by a discrete 
cosine transform, quantization, and variable length 
coding. Video compression between pictures is 
10 accomplished via a process referred to as motion 

estimation and compensation, in which a motion vector 
plus difference data is used to describe the 
translation of a set of picture elements (pels) from 
one picture to another. 



15 The ISO MPEG- 2 standard specifies only the 

syntax of bitstream and semantics of the decoding 
process. The choice of coding parameters and trade- 
offs in performance versus complexity are left to the 
encoder developers. 



20 One aspect of the encoding process is 

compressing a digital video image into as small a 
bitstream as possible while still maintaining video 
detail and quality. The MPEG standard places 
limitations on the size of the bitstream, and 

25 requires that the encoder be able to perform the 

encoding process. Thus, simply optimizing the bit 
rate to maintain desired picture quality and detail 
can be difficult. 



The MPEG- 2 standard is designed for motion 
30 video. Many coding tools and options are defined in 
the standard to achieve high quality pictures at low 
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bit rates. One significant feature of video 
compression in MPEG-2 is adaptive quantization, 
meaning that the quantization level is adjustable 
from one picture to the next and from one macroblock 
5 to the next within a picture. This flexibility 
allows an encoder to balance the output bitstream 
size and thereby achieve a constant bit rate output. 
Variation in quantization level also allows each 
compressed picture to have a different amount of 
10 encode bits based on complexity of intra and inter- 
picture characteristics . 

When the input video stream is constant, i.e., 
comprises a series of still frames, slight variations 
in the amount of detail in a macroblock of a current 

15 frame compared to the same macroblock of a prior 

frame or a next frame in the series of still frames 
can create fluctuation in luminance and/or 
chrominance data which can appear as movement between 
the frames notwithstanding that the frames actually 

20 comprise a series of still frames. For instance, 

variation in chrominance data from a prior frame to a 
current frame can create differences in shade of the 
same color. These differences in color shade 
effectively create pulsation artifacts which can 

25 cause a series of still pictures to come alive and no 
longer resemble the original input video. 

This invention thus seeks to enhance picture 
quality of an encoded video sequence having a series 
of still frames or partially still frames to enhance 
3 0 the encoding of the frames and prevent pulsation 
artifacts from, for example, still frame to still 
frame . 
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Disclosure of the Invention 



Briefly summarized, the invention comprises in 
one aspect a method for encoding a sequence of video 
frames which includes for each frame of the sequence: 
5 determining if the frame comprises a still frame; 
encoding the frame employing at least one 
controllable parameter; and adapting encoding of the 
frame if the determining determines that the frame is 
a still frame, wherein the adapting includes 

10 adjusting the at least one controllable parameter 

employed in encoding the frame to minimize pulsation 
artifacts between still frames of a series of still 
frames within the sequence of video frames, and 
wherein the still frame comprises one still frame of 

15 the series of still frames. 



In a further aspect, the invention comprises a 
method for encoding a frame of a sequence of video 
frames. The frame has a plurality of macroblocks and 
the method includes for at least some macroblocks of 

2 0 the frame: encoding the macroblock employing at least 
one controllable parameter; and adapting the encoding 
of the macroblock when the macroblock is a still 
macroblock, the adapting including adjusting the at 
least one controllable parameter employed in encoding 

25 the still macroblock to minimize pulsation artifacts 
between corresponding still macroblocks of adjacent 
frames in the sequence of video frames . 



In another aspect, the invention comprises a 
system for encoding a sequence of video frames. The 
3 0 system includes a pre-encode processing unit and an 
encoding engine. The pre-encode processing unit 
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includes a statistics measurement unit for use in 
determining if a current frame of the sequence of 
frames comprises a still frame, and a control unit 
for modifying at least one controllable parameter 
5 when the current frame is determined to comprise a 

still frame. The at least one controllable parameter 
is used by the encoding engine to encode the current 
frame of the sequence of video frames . 

In still another embodiment, the invention 
10 comprises a computer program product having computer 
usable medium with computer readable program code 
means therein for use in encoding a sequence of video 
frames. The computer readable program code means in 
the computer program product includes computer 
15 readable program code means for causing a computer to 
affect for each frame of the sequence of video 
frames: determining if the frame comprises a still 
frame; encoding the frame employing at least one 
controllable encode parameter; and adapting the 

2 0 encoding of the frame if the frame is determined to 

be a still frame, the adapting including adjusting 
the at least one controllable parameter employed in 
encoding the still frame to prevent pulsation 
artifacts between still frames of a sequence of still 
25 frames within the sequence of video frames, wherein 
the still frame comprises one still frame of the 
sequence of still frames. 

In general, encoding in accordance with the 
principles of the present invention minimizes or even 

3 0 eliminates perceptible picture pulsation in a 

sequence of MPEG encoded still frames or partial 
still frames. The encoding technique of this 
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invention can ensure constant picture quality for a 
series of still pictures and partially still 
pictures. Further, bits used in encoding a still 
picture or partially still picture are conserved by 
5 encouraging the use of skip macroblocks during the 

encode process. Adaptive encoding in accordance with 
this invention is performed by a pre-encode 
statistics gathering process and includes 
programmable still picture detection features. 

10 Brief Description of the Drawings 

The above -described objects, advantages and 
features of the present invention, as well as others, 
will be more readily understood from the following 
detailed description of certain preferred embodiments 
15 of the invention, when considered in conjunction with 
the accompanying drawings in which: 

Fig. 1 shows a flow diagram of a generalized 
MPEG-2 compliant encoder 11, including a discrete 
cosine transformer 21, a quantizer 23, a variable 
20 length coder 25, an inverse quantizer 29, an inverse 
discrete cosine transformer 31, motion compensation 
41, frame memory 42, and motion estimation 43. The 
data paths include the i m picture input 111, 
difference data 112, motion vectors 113 (to motion 

2 5 compensation 41 and to variable length coder 25) , the 

picture output 121, the feedback picture for motion 
estimation and compensation 131, and the motion 
compensated picture 101. This figure has the 
assumptions that the picture exists in frame 

3 0 memory or frame store 42 and that the i + l* is being 

encoded with motion estimation. 
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Fig. 2 illustrates the I, P, and B pictures, 
examples of their display and transmission orders, 
and forward, and backward motion prediction. 

Fig. 3 illustrates the search from the motion 
5 estimation block in the current frame or picture to 
the best matching block in a subsequent or previous 
frame or picture. Elements 211 and 211 1 represent 
the same location in both pictures. 

Fig. 4 illustrates the movement of blocks in 
10 accordance with the motion vectors from their 

position in a previous picture to a new picture, and 
the previous picture's blocks adjusted after using 
motion vectors. 

Fig. 5 shows a generalized encode system 300 in 
15 accordance with the present invention. System 30 0 
includes pre-encode statistics analysis 310 to 
determine whether an input picture comprises a still 
picture and based thereon whether one or more 
encoding parameters should be varied for the picture. 
20 Modified encoding parameters are forwarded to the 
encode engine 320 for use in encoding the input 
picture, thereby allowing adaptive encoding of a 
series of still frames within a sequence of motion 
video frames . 

25 Figs. 6a & 6b illustrate positioning of 

luminance and chrominance signals in 4:2:2 frame 
format and 4:2:0 frame format, respectively. 

Figs. 7a & 7b illustrate movement of an object 
380 for which a framewide summation statistic in 
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accordance with this invention would erroneously 
indicate that the frame 370' of Fig. 7b is a still 
frame, while use of a multi-region (372) statistical 
analysis correctly signals frame 370' of Fig. 7b to 
5 contain motion relative to frame 370 of Fig. 7a. 



Fig. 8 is a flow diagram of one embodiment for 
adaptive encoding of a still frame in accordance with 
the principles of the present invention. 

Fig. 9 is a flowchart of one embodiment for 
10 adaptively encoding a partially still frame in 
accordance with the principles of the present 
invention . 



Best Mode for Carrying Out the Invention 



The invention relates, for example, to MPEG 

15 compliant encoders and encoding processes such as 

described in "Information Technology-Generic coding 
of moving pictures and associated audio information: 
Video," Recommendation ITU-T H.262, ISO/IEC 13818-2, 
Draft International Standard, 1994. The encoding 

2 0 functions performed by the encoder include data 
input, spatial compression, motion estimation, 
macroblock type generation, data reconstruction, 
entropy coding, and data output. Spatial compression 
includes discrete cosine transformation (DCT) , 

25 quantization, and entropy encoding. Temporal 
compression includes intensive reconstructive 
processing, such as inverse discrete cosine 
transformation, inverse quantization, and motion 
compensation. Motion estimation and compensation are 

30 used for temporal compression functions. Spatial and 
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temporal compression are repetitive functions with 
high computational requirements. 

More particularly the invention relates, for 
example, to a process for performing spatial and 
5 temporal compression including discrete cosine 
transformation, quantization, entropy encoding, 
motion estimation, motion compensation, and 
prediction, and even more particularly to a system 
for accomplishing spatial and temporal compression. 

10 The first compression step is the elimination of 

spatial redundancy, for example, the elimination of 
spatial redundancy in a still picture of an "I" frame 
picture. Spatial redundancy is the redundancy within 
a picture. The MPEG-2 Draft Standard is using a 

15 block based method of reducing spatial redundancy. 
The method of choice is the discrete cosine 
transformation, and discrete cosine transform coding 
of the picture. Discrete cosine transform coding is 
combined with weighted scalar quantization and run 

20 length coding to achieve desirable compression. 

The discrete cosine transformation is an 
orthogonal transformation . Orthogonal 
transformations, because they have a frequency domain 
interpretation, are filter bank oriented. The 
25 discrete cosine transformation is also localized. 
That is, the encoding process samples on an 8x8 
spatial window which is sufficient to compute 64 
transform coefficients or sub-bands. 

Another advantage of the discrete cosine 
3 0 transformation is that fast encoding and decoding 
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algorithms are available. Additionally, the sub-band 
decomposition of the discrete cosine transformation 
is sufficiently well behaved to allow effective use 
of psychovisual criteria. 

5 After transformation, many of the frequency 

coefficients are zero, especially the coefficients 
for high spatial frequencies. These coefficients are 
organized into a zig-zag or alternate-scanned 
pattern, and converted into run-amplitude (run-level) 
10 pairs. Each pair indicates the number of zero 
coefficients and the amplitude of the non-zero 
coefficient. This is coded in a variable length 
code . 

Motion compensation is used to reduce or even 
15 eliminate redundancy between pictures. Motion 

compensation exploits temporal redundancy by dividing 
the current picture into blocks, for example, 
macroblocks, and then searching in previously 
transmitted pictures for a nearby block with similar 

2 0 content. Only the difference between the current 

block pels and the predicted block pels extracted 
from the reference picture is actually compressed for 
transmission and thereafter transmitted. 

The simplest method of motion compensation and 
25 prediction is to record the luminance and 

chrominance, i.e., intensity and color, of every 
pixel in an "I" picture, then record changes of 
luminance and chrominance, i.e., intensity and color 
for every specific pixel in the subsequent picture. 

3 0 However, this is uneconomical in transmission medium 

bandwidth, memory, processor capacity, and processing 
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time because objects move between pictures, that is, 
pixel contents move from one location in one picture 
to a different location in a subsequent picture. A 
more advanced idea is to use a previous or subsequent 
5 picture to predict where a block of pixels will be in 
a subsequent or previous picture or pictures, for 
example, with motion vectors, and to write the result 
as "predicted pictures" or "P" pictures. More 
particularly, this involves making a best estimate or 

10 prediction of where the pixels or macroblocks of 
pixels of the 1 th picture will be in the i-l* or i + 
picture. It is one step further to use both 
subsequent and previous pictures to predict where a 
block of pixels will be in an intermediate or "B" 

15 picture. 

To be noted is that the picture encoding order 
and the picture transmission order do not necessarily 
match the picture display order. See Fig. 2. For I- 
P-B systems the input picture transmission order is 
2 0 different from the encoding order, and the input 

pictures must be temporarily stored until used for 
encoding. A buffer stores this input until it is 
used. 

For purposes of illustration, a generalized 
25 flowchart of MPEG compliant encoding is shown in Fig. 
1. In the flowchart the images of the i* picture and 
the i+1* picture are processed to generate motion 
vectors. The motion vectors predict where a 
macroblock of pixels will be in a prior and/or 
30 subsequent picture. The use of the motion vectors is 
a key aspect of temporal compression in the MPEG 
standard. As shown in Fig. 1 the motion vectors, 
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once generated, are used for the translation of the 
macroblocks of pixels, from the i* picture to the i + l* 
picture . 



As shown in Fig. 1, in the encoding process, the 
5 images of the i* picture and the i + l* picture are 
processed in the encoder 11 to generate motion 
vectors which are the form in which, for example, the 
i + l th and subsequent pictures are encoded and 
transmitted. An input image 111 of a subsequent 

10 picture goes to the motion estimation unit 43 of the 
encoder. Motion vectors 113 are formed as the output 
of the motion estimation unit 43. These vectors are 
used by the motion compensation Unit 41 to retrieve 
macroblock data from previous and/or future pictures, 

15 referred to as "reference" data, for output by this 
unit. One output of the motion compensation Unit 41 
is negatively summed with the output from the motion 
estimation unit 43 and goes to the input of the 
Discrete Cosine Transformer 21. The output of the 

20 discrete cosine transformer 21 is quantized in a 

quantizer 23 . The output of the quantizer 23 is split 
into two outputs, 121 and 131; one output 121 goes 
to a downstream element 25 for further compression 
and processing before transmission, such as to a run 

2 5 length encoder; the other output 131 goes through 

reconstruction of the encoded macroblock of pixels 
for storage in frame memory 42 . In the encoder shown 
for purposes of illustration, this second output 131 
goes through an inverse quantization 2 9 and an 

3 0 inverse discrete cosine transform 31 to return a 

lossy version of the difference macroblock. This data 
is summed with the output of the motion compensation 
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unit 41 and returns a lossy version of the original 
picture to the frame memory 42 . 

As shown in Fig. 2, there are three types of 
pictures. There are "Intra pictures" or "I" pictures 
5 which are encoded and transmitted whole, and do not 
require motion vectors to be defined. These "I" 
pictures serve as a reference image for motion 
estimation. There are "Predicted pictures" or "P" 
pictures which are formed by motion vectors from a 

10 previous picture and can serve as a reference image 
for motion estimation for further pictures. Finally, 
there are "Bidirectional pictures" or "B" pictures 
which are formed using motion vectors from two other 
pictures, one past and one future, and can not serve 

15 as a reference image for motion estimation. Motion 
vectors are generated from "I" and "P" pictures, and 
are used to form "P" and "B" pictures. 

One method by which motion estimation is carried 
out, shown in Fig. 3, is by a search from a 

20 macroblock 211 of an i* picture throughout a region 

of the next picture to find the best match macroblock 
213. Translating the macroblocks in this way yields 
a pattern of macroblocks for the i + l* picture, as 
shown in Fig. 4. In this way the i* 1 picture is 

25 changed a small amount, e.g., by motion vectors and 
difference data, to generate the i + l* picture. What 
is encoded are the motion vectors and difference 
data, and not the i+l^ picture itself. Motion vectors 
translate position of an image from picture to 

30 picture, while difference data carries changes in 
chrominance, luminance, and saturation, that is, 
changes in shading and illumination. 
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Returning to Fig. 3, we look for a good match by 
starting from the same location in the picture as 
in the i+l* picture. A search window is created in 
the 1 th picture. We search for a best match within 
5 this search window. Once found, the best match motion 
vectors for the macroblock are coded. The coding of 
the best match macroblock includes a motion vector, 
that is, how many pixels in the y direction and how 
many pixels in the x direction is the best match 
10 displaced in the next picture. Also encoded is 

difference data, also referred to as the "prediction 
error 1 ', which is the difference in chrominance and 
luminance between the current macroblock and the best 
match reference macroblock. 

15 The operational functions of an MPEG-2 encoder 

are discussed in detail in commonly assigned, co- 
pending United States Patent Application Serial No. 
08/831,157, by Carr et al . , filed April 1, 1997, 
entitled "Control Scheme For Shared-Use Dual-Port 

20 Predicted Error Array, " which is hereby incorporated 
herein by reference in its entirety. 

As noted initially, encoder performance and 
picture quality are often enhanced today through the 
use of adaptive quantization. Examples of adaptive 

25 quantization are presented in co-pending, commonly 
assigned United States Patent Applications by 
Boroczky et al., entitled "Adaptive Real-Time 
Encoding of Video Sequence Employing Image 
Statistics," filed October 10, 1997, serial no. 

30 08/948,442, and by Boice et al . , entitled "Real-Time 
Variable Bit Rate Encoding of Video Sequence 
Employing Image Statistics," filed January 16, 1998, 
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serial no. 09/008,282, both of which are hereby 
incorporated herein by reference in their entirety. 

Adaptive quantization can be used to control the 
amount of data generated so that an average amount of 
5 data is output by the encoder and so that this 
average will match a specified bit rate, thereby 
achieving an overall constant bit rate. Since this 
compression has adaptive capabilities, there is a 
possibility that macroblocks in a still frame can 

10 change slightly from corresponding macroblocks in a 
prior or reference still frame. This is partially 
due to the type of motion estimation employed. For 
example, hierarchical motion estimation is often used 
for MPEG encoding in order to save circuitry, enhance 

15 performance and reduce memory bandwidth. 

Hierarchical motion estimation, however, does not 
rely on a full motion estimation search pixel by 
pixel. Rather, a scaled down picture size is 
employed by grouping pixels together (for example, 

20 four pixels) into a single modified pixel value, 
which results in loss of accuracy when motion 
estimation is performed. This inaccuracy can create 
small errors, which in the case of motion video is 
acceptable because the difference data (i.e., the 

25 prediction error) is forwarded to the decoder for use 
in recreating the original picture. However, in the 
case of still pictures, these inaccuracies are 
manifested as pulsation artifacts due to differences 
in shading and color between adjacent still frames, 

3 0 which cause the still picture to appear to be moving. 

Based on this observation, applicants have 
conceived of the present invention to enhance MPEG 
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encoding of a still or partially still picture 
sequence and minimize or even eliminate pulsation 
artifacts between frames in a sequence of still 
video . 

5 Two considerations are important to preventing 

pulsation between still pictures. One requirement is 
that the same coding type be employed from still 
picture to still picture, and the second is that a 
comparable level of quantization be maintained for 
10 the same macroblock from still picture to still 

picture. However, these coding options can only be 
applied when a picture is first determined to be a 
still picture. 

When successive frames of a sequence of still 
15 frames are the same, both the luminance and 

chrominance data (or intensity) of each pair of still 
pictures in the sequence should be the same, with the 
exception of a small amount of noise. In addition, 
variance of the value of the same pixel across two 
20 adjacent still pictures should be zero, or a small 

value if noise is present. This information is used 
in accordance with the present invention to 
accumulate in a picture preprocessing stage image 
statistics to determine whether the current picture 
25 is a still picture within a sequence of still 
pictures . 

Fig. 5 depicts one embodiment of an encode 
system, generally denoted 300, in accordance with 
this invention. System 300 includes a pre-encode 
3 0 stage 310 where statistics gathering and analysis are 
performed on each frame of a sequence of motion video 
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frames to determine whether the current frame is a 
still frame relative to the preceding frame of the 
sequence. As shown, an input sequence of video 
frames is buffered in frame memory 330, while 
5 preprocessing of the input stream by statistics 

gathering and analysis 310 is performed. Controller 
34 0 determines where a given input picture should be 
placed within frame memory 330, as well as when to 
encode the picture. 

10 Preprocessing 310 gathers statistics indicative 

of whether the current picture comprises a still 
picture 312 and places them into a buffer 314. 
Stacking of input picture statistics is needed 
because the GOP structure employed in MPEG encoding 

15 of a sequence of video frames may have to be 

reordered prior to encoding. Pursuant to controller 
34 0 microcode, when a given frame is to be encoded 
preprocessing 310 determines whether one or more 
encoding parameters should be adjusted 316 for the 

2 0 frame based on whether the current frame comprises a 

still frame. As described further below, adjustable 
parameters may include a motion vector mode setting, 
a macroblock quantization level, and a target bitrate 
for encoding the frame. This information is 
25 forwarded to the encoder engine 32 0 commensurate with 
retrieval of the input data to be compressed from 
frame memory 330. Unless otherwise stated herein, 
encode engine 320 can comprise conventional MPEG 
compression processing as summarized initially 

3 0 herein. 

By way of example, statistics analysis 310 
determines whether the current frame is stationary, 



EN998027 



-17- 



i.e., comprises a still frame, by determining for 
example a summation of all pixel data in the current 
frame, which is referred to herein as a PIX-SUM, and 
by determining an accumulated absolute value of a 
5 difference in pixel data between adjacent pixels in 
the current frame. This comparison is referred to as 
a "PIX-DIFF". 



Figs. 6a & 6b represent positioning of luminance 
and chrominance samples for 4:2:2 and 4:2:0 picture 

10 data, respectively. As noted, two statistics are 
preferably gathered in accordance with this 
invention, namely, PIX-SUM and PIX-DIFF. The 
equations for PIX-SUM can vary with the data format, 
while the equation for PIX-DIFF is the same for both 

15 formats. For a 4:2:2 picture format, an equation for 
PIX-SUM is: 



PIX-SUM = Max Max 

2£ £ L x , y 

x=l y=l 

20 Max Max 

+ E £ Cb 

x , y 

x=l y=l,3,5... 

Max Max 

+ £ £ cr x<y 

x=l 7=1,3,5... 



25 



Where: x is the row number from 1 to the 

maximum number of lines in the picture 
(e.g. , 480) ; and 



y is the column number from 1 to the 
30 maximum number of columns in the picture 

(e.g., 720) . 
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PIX-DIFF is determined by imagining the 
luminance data lines of the picture concatenated to 
form a long line of luminance samples. Then for a 
given picture, the equation for PIX-DIFF statistics 
is : 



PIX-DIFF 



Max 



The PIX-DIFF statistic essentially comprises finding 
10 the difference between two adjacent luminance pixels 
in this concatenated string of luminance data for the 
frame and then summing the absolute values of those 
differences. As an alternative, PIX-DIFF could be 
defined as an accumulation of both luminance and 
15 chrominance data for the current frame, or an 
accumulation of chrominance data only. 

For a 4:2:0 formatted picture, the PIX-SUM 
equation would be: 



Max 

PIX SUM 4 Y, 
x 1 



Max 

E 

x 1.5,3.5. . . \y 1,3,5. . . 



f Max 

E 

\y i 

(Max 

E 



(Cb Xiy ) 



Max 

E 

x 1.5,3.5. . . 



'Max 

E < Cr x, y ) 

\y 1,3,5. . . 



20 Where x & y are as defined above in connection with 
the 4:2:2 formatted picture. 
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In a sequence of frames where frame 1 is 
followed by frame 2, the PIX-SUM of frame 1 can equal 
the PIX-SUM of frame 2 without the content of the 
pictures being stationary. This is depicted in Figs, 
5 7a & 7b where a first frame 370 has an object 380 
therein and a uniform or blank background. In Fig. 
7b, the second frame 3 70' has the same uniform 
background, however object 380 has moved within the 
frame. If statistically analyzed on a frame basis, 

10 the object within the frame will be identical and its 
movement will have been lost. Thus, it is preferable 
to employ multiple regions, such as rows or stripes 
372, to improve accuracy of the PIX-SUM statistic as 
a means of detecting a still frame. By dividing the 

15 frame into different regions, and requiring that the 
PIX-SUMs of each region of adjacent pictures be 
within a predefined limit, the PIX-SUM statistic can 
better ensure the absence of motion from picture 1 to 
picture 2. Note that four stripes 372 are depicted 

2 0 in Figs. 7a & 7b by way of example only. Any number 

of regions "1" could be employed between 1 (i.e., the 
entire picture) and n, where n is the number of 
pixels in each picture. In the embodiment presented 
herein, a picture is deemed still if the PIX-SUM for 
25 the four stripes in adjacent pictures are equal or 

within a predefined threshold X, and the PIX-DIFF for 
the adjacent frames is also within a predefined 
threshold Y. By way of example, an experimentally 
determined value of 256 can be employed for both X 

3 0 and Y above. 



Any number of regions 
frames. These regions can 
vertical bands, or regions 



can be employed within the 
be horizontal bands, 
of varying shape, etc. 
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The pixel values accumulated in one band are referred 
to as "PIX-SUM kl n , wherein "k" is the current picture 
and "1" is the band, e.g., 1 to 4 as depicted in 
Figs. 7a & 7b. This invention thus determines 
5 whether the current frame is stationary by looking at 
the PIX-SUM for each region and comparing the 
differences in these sums to an empirically 
determined acceptable noise level. If they are all 
within the predefined bound, then the frame is 
10 considered stationary. A picture is thus 
statistically a still picture if: 

£(PIX-SUM kl - PIX-SUM (kM)1 ) < X 

picture number, 
the region of the picture; i.e., 1-4 
for the four bands of Figs. 7a, 7b. 
an experimentally determined number 
looking at different types of still 
pictures. As one example, the number 
may comprise 256. 

20 Additionally, identifying the current frame as a 

still frame requires: 

£(|PIX-DIFF k - PIX-DIFF k+1 |) < Y 

Wherein Y is an experimentally predetermined value 
representative of how much noise can be tolerated and 
25 still identify the current frame as a still picture. 
The value relates to picture size and for a 
conventional 720 x 480 pixel picture may be 256. 



Where : k = 
1 = 

15 

X = 



EN998027 



-21- 



Once the current frame is statistically 
determined to comprise a still frame, selected 
encoding parameters are modified for use in 
compressing the still picture. For example, a zero 
5 motion vector mode is set ON restricting the motion 
vector search range to the macroblock size for the 
still picture. This means each macroblock is only 
compared to the corresponding macroblock in the 
reference still picture. The accumulated difference 

10 for the macroblock comparisons is the "prediction 

error" quantized prior to variable length coding to 
form the bit stream. The only way to maintain the 
same macroblock coding type is to match the 
macroblock perfectly with the reference picture's 

15 corresponding macroblock. A perfect match allows the 
macroblock to be coded as a "skip" macroblock. When 
two macroblocks are identical, the prediction error 
is zero. However, the prediction error may contain a 
small value in the presence of noise. The 

2 0 macroblocks are thus considered identical when the 

prediction error is within a predefined threshold 
obtained from experiments. By way of example, in the 
flowchart embodiment of Fig. 8 (described below) a 
value of 12 8 is used. Thus, if the macroblock 
25 difference (MBD) is less than this predefined value, 
the prediction error for the macroblock is made zero. 

Preferably, a fixed, minimum macroblock 
quantization level is used to maintain similar levels 
of still picture quantization. Picture quality is 

3 0 directly related to the level of quantization, which 

in turn, is related to the picture complexity and the 
amount of bits allocated to compress the picture. 
When all pictures match perfectly to a reference (I) 
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still picture, subsequent picture quality will only 
be as good as the reference picture. The quality of 
this reference picture thus controls the quality of 
the remaining pictures in a sequence of still frames. 
5 (When a picture matches perfectly to a reference 

picture, most macroblocks of that picture will become 
skip macroblocks pursuant to this invention and only 
header bits will be needed to code the still 
picture . ) 

10 For this reason, when two adjacent pictures are 

initially detected as comprising still pictures, a 
percentage of the bits normally allocated to the 
second, non-reference picture is preferably 
reallocated to the first, reference (I) picture. 

15 This allows the reference still picture quality to be 
increased. By way of example, fifty percent of the 
encode bits for a still frame following a reference 
still frame for a sequence of still frames are 
preferably reallocated to the reference picture in 

20 accordance with this invention. This is possible 
because the statistics analysis of the frames 
described herein comprises pre-encode processing, 
meaning that this information will be available prior 
to encoding of the initial reference (I) still 

25 picture. 

In addition, the average quantization level of 
the reference picture is preferably used as a minimum 
quantization level for the remaining pictures in a 
sequence of still frames. Although a perfect match 
30 is expected between two still frames in a sequence, 
random noise makes the number of skip macroblocks in 
the picture unpredictable. To use an inaccurate 
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quantization level to produce a constant bit rate 
stream is risky. An adjustable yet stable 
quantization level is desired, and it will allow 
unused bits of one macroblock to be used in another 
5 where a perfect match is not found. A stable 

quantization level can be used by both skip and non- 
skip macroblocks. A minimum quantization scheme is 
preferred to avoid overusage of bits, and encoded 
macroblocks with underproduced bits are preferably 
10 filled by padding bits as described below. 

The reference macroblock data is used to display 
all skip macroblocks. A successful still picture 
detection means the reference picture is used to 
display all still pictures. In other words, the 
15 reference picture is simply displayed again and 
again, thereby producing the desired stationary 
image . 

Although the MPEG- 2 standard is designed for 
motion video, the scheme described above allows a 
20 sequence of still frames within motion video to be 
compressed using the MPEG-2 standard while still 
minimizing or preventing perceptible pulsation 
artifacts when the still pictures are subsequently 
decompressed and displayed. 

25 Fig. 8 depicts one embodiment of picture 

preprocessing and encoding in accordance with the 
principles of the present invention. Upon detection 
that an input picture is available 400, the pre- 
encode statistics gathering process calculates the 

3 0 PIX-SUM and PIX-DIFF values for a current picture 
k+1. The PIX-SUM between the current picture PIX- 



EN998027 



-24- 



SUM k+1 and the previous picture PIX-SUM k is examined 
and the absolute value is determined 410. The PIX- 
DIFF for the current picture (PIX-DIFF k+1 ) as well as 
the PIX-DIFF k for the just prior picture are also 
5 determined and their absolute difference is obtained. 
If summations of these absolute differences are each 
less than a respective predefined value, then a still 
picture is identified 420. The empirically measured, 
predefined value for each comparison in the example 

10 of Fig* 8 is 256. Should either comparison be 

untrue, then the current picture contains motion 43 0 
and conventional video encoding of the previous 
picture (PIC k ) , for example, pursuant to MPEG 
standard, is performed. However, if both 

15 inequalities 420 are true, then the current picture 

comprises a still picture and a still picture flag is 
set for the current picture (PIC k+1 ) 440. 

Processing next determines whether the current 
picture PIC k+1 and the prior picture PIC k both 
20 comprise still pictures in accordance with each 

picture's PIX-SUM and PIX-DIFF values 450. If only 
current picture PIC k+1 is determined to be a still 
picture, then encoding parameters for the previous 
picture PIC k will remain unchanged, and conventional 

2 5 video encoding of the picture is to be performed. 

However, current picture PIC k+1 may subsequently be 
determined to comprise a beginning reference (I) 
still picture for a sequence of still frames and have 
one or more parameters modified in accordance with 

3 0 that determination. 

Once it is determined that both the current 
picture and the prior picture comprise still 
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pictures, then encoding parameters for the current 
picture PIC k4 . lf and the prior picture PIC k (i.e., 
assuming that it comprises the reference picture for 
the sequence of still frames) , will be modified in 
5 accordance with this invention. First, the motion 
vector mode for the current picture is set to zero 
460 to limit the macroblock search window and 
determination is made whether the prior picture PIC k 
comprises a reference (I) still picture 470. 
10 Assuming that the prior picture PIC k is the reference 
still picture, then the target bit rate for the prior 
reference still picture PIC k is modified. 
Specifically, "the target bit rate for PIC k " (herein 
" Pic_target k n ) is redefined as: 

15 Pic_target k = Pic_target k + 1/2 Pic_target k+1 

Moving fifty percent of the current target bit 
rate to the prior picture PIC k is presented herein by 
way of example only. Those skilled in the art will 
recognize that more or less bits may be moved to the 

20 reference still picture PIC k from the next subsequent 
still picture PIC k+1 . The concept again is to 
increase the target bits for the reference still 
picture in order to enhance encoding of the reference 
picture since this picture is subsequently used in 

25 the sequence of still pictures, thereby improving 
quality of all still pictures displayed in the 
sequence. In accordance with this invention, bits in 
still pictures subsequent to the reference still 
picture will often go unused, except for those 

3 0 necessary to code the header data. 
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The prior reference picture PIC k is then encoded 
using the modified picture target rate, and an 
average quantization level for the reference picture 
(Av QL ref ) is collected 490. 

5 Assuming that the current picture PIC k+1 is a 

still picture other than a reference still picture, 
then the zero motion vector mode is again set ON 460. 
This means that the motion vector search window will 
be exactly the size of the given macroblock, and in 

10 the same position as the corresponding macroblock in 
the prior picture. A macroblock count is then set to 
"1" 53 0 to initiate encoding of the frame. A 
determination is made whether this macroblock 
comprises an edge macroblock in the frame 540. If 

15 the macroblock is an edge macroblock, then MPEG 
standard requires conventional encoding of the 
macroblock. If the macroblock is other than an edge 
macroblock, then the comparison between corresponding 
macroblocks of adjacent still pictures is made to 

20 derive macroblock difference (MBD) data (i.e., the 

prediction error data for the macroblock) . As noted 
above, this MBD is compared against a predefined 
value, for example, 128 to determine whether this 
macroblock contains motion. In this example, a value 

25 less than 128 is indicative merely of noise and 

therefore the MBD is zeroed out (MBD = 0) 550. If 
MBD is zero, then the current macroblock is identical 
to the corresponding macroblock in the reference 
picture and the current macroblock can be encoded as 

3 0 a skip macroblock. 

When a macroblock comprises a skip macroblock, 
basically there is no data being encoded. Thus, if 



EN998027 



-27- 



adaptive quantization is employed, a very low 
quantization level (QL) results which may be 
significantly different from the quantization level 
used in the reference still picture. This difference 
5 in quantization level between corresponding 
macroblocks in a reference still picture and 
subsequent still picture may appear after compression 
and decompression as a difference, for example, in 
color shade which can result in the appearance of 
10 movement between still pictures in the sequence. 

This movement is referred to herein as "pulsation 
artifacts" . 

Thus, in accordance with the present invention, 
after determining that a macroblock is a skip 
15 macroblock, the average quantization level for the 
reference picture in the still sequence is used in 
place of the quantization level otherwise calculated 
for the skip macroblock. Thus, if QL < Av QL rcf , then 
set QL = Av QL ref , otherwise use the quantization 

2 0 level (QL) determined for the macroblock 560 to code 

that macroblock when completing encoding of the 
macroblock 565. Processing then determines whether 
this macroblock is the last macroblock of the picture 
by checking the macroblock counter 570, and if not, 
25 the counter is incremented and return is made to 
determine whether the next macroblock is an edge 
macroblock 540. 

After encoding a still picture in accordance 
with the process of Fig. 8, the number of bits used 

3 0 to encode that picture is compared to the target bits 

for the picture 500. If there are a number of skip 
macroblocks in the still picture, there will be a 
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number of unused bits left over following encoding of 
the picture. To avoid varying the quantization level 
or the amount of data in the next picture, the 
encoded picture is preferably padded out with zeros 
5 by an amount equal to the number of unused bits. 

This ensures there will be no bits left over to roll 
to the next still picture. After encoding, including 
padding out any unused bits, the current picture k+1 
and prior picture k are incremented assuming that the 

10 current picture is not the last picture in a defined 
group of pictures (GOP) 510. Thereafter, return is 
made to encode the next picture 520. Based upon the 
above, those skilled in the art will note that 
preprocessing and encoding in accordance with this 

15 invention results in enhanced display of a reference 
still picture and its following still pictures to 
produce the desired stationary picture using moving 
picture compression/decompression . 

In an alternate aspect, the invention presents a 
20 process for encoding a partially still picture. For 
example, a partial still picture may be a picture 
with a stationary background and motion in the 
foreground. In this case, motion estimation is 
biased to the current macroblock position. The 
25 macroblocks corresponding to the stationary 

background would produce the smallest prediction 
error. However, this prediction error may not be 
zero due to the presence of noise. Thus, a threshold 
value is obtained from experiments, and is used to 
3 0 filter out any unwanted noise when the prediction 
error is obtained from a zero motion vector. The 
prediction error is made zero if it is within this 
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threshold. In one implementation, a value of 64 can 
be used. 

To restate, in accordance with this aspect of 
the invention if there is a partially still picture, 
5 then for every macroblock of the picture, the motion 
vectors are evaluated to determine whether there is a 
difference between the current picture and the 
previous picture. If the difference data is small, 
then the difference is written off as noise and the 

10 macroblock difference (MBD) or prediction error is 

made zero. In this case, the macroblock is assigned 
the reference quantization level of the last 
reference, partially still picture, thereby making 
stationary that portion of the current picture since 

15 the average quantization level of the reference 

picture is used as a minimum quantization level for 
corresponding macroblocks. This scheme thus produces 
stable backgrounds in partially still pictures by 
maintaining the macroblock coding type and the 

20 macroblock quantization level. 

Fig. 9 depicts one embodiment for processing a 
non-still picture with potentially one or more still 
macroblocks. Encoding begins 600 by setting a 
macroblock counter to "1" 610 and determining whether 

2 5 the current macroblock is an edge macroblock of the 

picture 62 0. When the macroblock is an edge 
macroblock, MPEG standard does not allow the 
macroblock to be defined as a skip macroblock and 
conventional macroblock encoding is performed 630. 

3 0 However, if the current macroblock is other than an 

edge macroblock, motion estimation is performed on 
the macroblock to determine a predictive error and 
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thereby determine whether the macroblock comprises an 
intra macroblock or a non- intra macroblock. Assuming 
that the macroblock is a non- intra macroblock 640, 
the picture type determines the specific coding 
5 change. If the picture is an I picture, conventional 
encoding of the macroblock is performed 630. 
However, if the picture type is either P or B, then 
the macroblock coding type and the macroblock 
quantization level can potentially be modified in 
10 accordance with this aspect of the present invention. 

For a P type picture, processing determines 
whether the motion vector in the motion estimation 
result is zero, and whether the prediction error or 
macroblock difference (MBD) is less than a predefined 

15 threshold. This predefined threshold can again be 

experimentally determined, and in the example of Fig. 
9, comprises 64. If both of these conditions are 
met, then the macroblock difference is set to zero 
making the macroblock a skip macroblock and the 

20 quantization level for the macroblock is set to the 
average quantization level for the reference picture 
in the GOP. Thereafter, processing determines 
whether the current macroblock comprises the last 
macroblock of the picture. If not, the macroblock 

25 counter is incremented 710 to initiate processing of 
the next macroblock, otherwise return is made to 
process a next picture 720. 

Assuming that the current picture type 650 is B, 
then processing next inquires whether the motion 
30 vector in the motion estimation result is equal to 
the motion vector of the previous macroblock of the 
same picture and whether the macroblock difference is 
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less than a predefined threshold 680. In this 
example, the predefined threshold is again assigned a 
value of 64, which can be varied as desired based 
upon experimentation. If both conditions are met, 
5 then the macroblock is again defined as a skip 
macroblock and the quantization level for the 
macroblock is set to the average quantization level 
for the reference frame 670. If the requirements of 
either inquiry 660 or 680 are unmet, then the 

10 macroblock difference value remains the determined 

macroblock difference value and a quantization level 
is calculated as conventionally done for MPEG 
encoding 690. The resulting MBD and quantization 
level values are employed in encoding the current 

15 macroblock. Subsequently, processing inquires 

whether the current macroblock is the last macroblock 
of the picture 70 0, and proceeds therefrom as 
described above. 

From the above discussion, those skilled in the 
20 art will note that encoding in accordance with the 

principles of the present invention minimizes or even 
eliminates perceptible picture pulsation in a 
sequence of MPEG encoded still frames or partially 
still frames. The encoding technique of this 
25 invention can ensure constant picture quality for a 
series of still pictures and/or partially still 
pictures. Further, bits used in encoding a still 
picture or partially still picture are conserved by 
encouraging the use of skip macroblocks during the 
30 encode process. Adaptive encoding in accordance with 
this invention is performed by a pre-encode 
statistics gathering process and includes 
programmable still picture detection features. 
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The present invention can be included, for 
example, in an article of manufacture (e.g., one or 
more computer program products) having, for instance, 
computer usable media. This media has embodied 
5 therein, for instance, computer readable program code 
means for providing and facilitating the capabilities 
of the present invention. The articles manufactured 
can be included as part of the computer system or 
sold separately. 

10 The flow diagrams depicted herein are provided 

by way of example. There may be variations to these 
diagrams or the steps or operations described herein 
without departing from the spirit of the invention. 
For instance, in certain cases the steps may be 

15 performed in differing order, or steps may be added, 
deleted or modified. All these variations are 
considered to comprise part of the present invention 
as recited in the appended claims. 

While the invention has been described in detail 
20 herein in accordance with certain preferred 

embodiments thereof, many modifications and changes 
therein may be affected by those skilled in the art. 
Accordingly, it is intended by the appended claims to 
cover all such modifications and changes as fall 
25 within the true spirit and scope of the invention. 
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Claims 



1 1 . A method for encoding a sequence of video 

2 frames comprising for each frame of the sequence of 

3 video frames: 

4 (a) encoding said frame employing at least 

5 one controllable parameter; and 

6 (b) adapting said encoding (a) of said 

7 frame when said frame is a still frame, said 

8 adapting including adjusting said at least one 

9 controllable parameter employed in encoding said 

10 still frame to minimize pulsation artifacts 

11 between still frames of a sequence of still 

12 frames within said sequence of video frames, 

13 wherein said still frame comprises one still 

14 frame of said sequence of still frames. 

1 2. The method of claim 1, further comprising 

2 determining whether said frame comprises said still 

3 frame . 
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1 3. The method of claim 2, wherein each frame 

2 of the sequence of video frames comprises a plurality 

3 of pixels, and wherein each pixel of each frame 

4 comprises a mult i -bit value and said determining 

5 comprises : 

6 determining for a current frame (k+1) of 

7 the sequence of frames a summation statistic 

8 (PIX-SUM k+1 ) derived from said multi-bit values 

9 of the plurality of pixels of the current frame; 

10 determining a summation statistic (PIX- 

11 SUM k ) derived from the multi-bit values of the 

12 plurality of pixels of a prior frame (k) 

13 preceding the current frame (k+1) in the 

14 sequence of video frames; and 

15 determining whether: 

16 |PIX-SUM k - PIX-SUM k+1 | < X 

17 where X is a predefined value representative of 

18 a still frame. 
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1 4. The method of claim 3, wherein said 

2 determining further comprises: 

3 determining an accumulated absolute 

4 difference derived from adjacent pixels of said 

5 plurality of pixels of the current frame (PIX- 

6 DIFF k+1 ) ; 

7 determining an accumulated absolute 

8 difference derived from adjacent pixels of said 

9 prior frame (PIX-DIFF k ) ; and 

10 determining whether: 

11 |PIX-DIFF k - PIX-DIFF k ^| < Y 

12 wherein Y is a predefined value, and 

13 wherein said current frame is determined to 

14 comprise said still frame if both |PIX-SUM k - 

15 PIX-SUM k+1 | < X and | PIX-DIFF k - PIX-DIFF k+1 | < Y 

16 are true. 

1 5. The method of claim 4, wherein said 

2 determining further comprises dividing the current 

3 frame and the prior frame into z corresponding 

4 regions, and wherein said determining comprises 

5 determining whether for each of said z corresponding 

6 regions : 

7 |PIX-SUM kz - PIX-SUM (k+1)z | < X. 
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1 6. The method of claim 5, wherein X=2 56, 

2 Y=256, and z;>4, and wherein said four corresponding 

3 regions of said current frame and said prior frame 

4 comprise four horizontal bands. 

1 7. The method of claim 2, further comprising 

2 determining whether said frame comprises a reference 

3 (I) still frame for said sequence of still frames and 

4 if so increasing a target bitrate to be used by said 

5 encoding (a) to encode said reference still frame. 

1 8. The method of claim 7, wherein said 

2 increasing of the target bitrate for said reference 

3 still frame comprises detecting a subsequent B or P 

4 still frame in said sequence of still frames and 

5 moving target bits from said subsequent B or P still 

6 frame to said reference still frame. 

1 9. The method of claim 8, wherein said moving 

2 of target bits comprises moving fifty percent of 

3 target bits for encoding said subsequent B or P still 

4 frame to encoding of said reference still frame. 

1 10. The method of claim 2, wherein each frame 

2 comprises a plurality of macroblocks, and said method 

3 further comprises setting a zero motion vector mode 

4 ON upon determining that said frame comprises a still 

5 frame, and determining a predictive error for each 

6 macroblock of said still frame, and when said 

7 predictive error is less than a predetermined value, 

8 setting said predictive error to zero making said 

9 macroblock a skip macroblock. 
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1 11. The method of claim 10, further comprising 

2 calculating a quantization level (QL) for said skip 

3 macroblock for use in encoding said macroblock, and 

4 comparing the calculated quantization level (QL) to 

5 an average quantization level of a reference still 

6 frame (Av QL ref ) of said sequence of still frames and 

7 replacing said calculated quantization level with 

8 said average quantization level of said reference 

9 still frame when said calculated quantization level 

10 is less than said average quantization level of said 

11 reference still frame. 

1 12. The method of claim 1, wherein when said 

2 frame comprises a still frame, said adapting (b) 

3 comprises defining a plurality of macroblocks in said 

4 still frame as skip macroblocks, and maintaining a 

5 minimum quantization level for encoding (a) of each 

6 skip macroblock of said still frame. 

1 13. The method of claim 12, wherein said 

2 adapting (b) comprises maintaining said minimum 

3 quantization level for each skip macroblock of said 

4 still frame to be an average quantization level of a 

5 reference still frame (Av QL rcf ) for said sequence of 

6 still frames. 
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1 14. The method of claim 2, wherein said frame 

2 comprises a plurality of macroblocks, and wherein 

3 said determining comprises determining whether said 

4 frame comprises a motion frame, and when so, said 

5 method further comprises for each of at least some 

6 macroblocks of said motion frame: 

7 (i) determining whether said macroblock 

8 comprises a still macroblock; 

9 (ii) encoding said macroblock employing at 

10 least one controllable parameter; and 

11 (iii) adapting said encoding of said 

12 macroblock when said determining (i) determines 

13 said macroblock to be said still macroblock, 

14 said adapting including adjusting said at least 

15 one controllable parameter employed in encoding 

16 said still macroblock to minimize pulsation 

17 artifacts between corresponding still 

18 macroblocks of adjacent frames in said sequence 

19 of video frames. 
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1 15. The method of claim 14, wherein said 

2 adapting of said encoding comprises confirming that 

3 said still macroblock is other than an edge 

4 macroblock and is a non- intra macroblock, and that 

5 said frame comprises a P frame, and when confirmed, 

6 said method further comprises: determining that a 

7 motion vector for said still macroblock is zero and a 

8 macroblock difference (MBD) value is less than a 

9 predefined value, and when true, encoding said still 

10 macroblock as a skip macroblock and assigning a 

11 minimum quantization level to said skip macroblock. 

1 16. The method of claim 15, wherein said 

2 assigning of the minimum quantization level comprises 

3 assigning an average quantization level for a 

4 reference frame for said P frame as quantization 

5 level for said skip macroblock. 

1 17. The method of claim 14, wherein said 

2 adapting of said encoding comprising confirming that 

3 said still macroblock is other than an edge 

4 macroblock and is a non- intra macroblock, and that 

5 said frame comprises a B frame, and when confirmed, 

6 said method further comprises: determining that a 

7 motion vector for said still macroblock is equal to a 

8 motion vector of a previous macroblock in the B frame 

9 and that a macroblock difference (MBD) value is less 

10 than a predefined value, and if so, then encoding 

11 said still macroblock as a skip macroblock and 

12 assigning a minimum quantization level to said skip 

13 macroblock. 
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1 18. The method of claim 17, wherein said 

2 assigning of the minimum quantization level comprises 

3 assigning an average quantization level for a 

4 reference frame for said B frame as quantization 

5 level for said skip macroblock. 
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1 19. A method for encoding a frame of a sequence 

2 of video frames, said frame having a plurality of 

3 macroblocks, said method comprising for each of at 

4 least some macroblocks of said plurality of 

5 macroblocks : 

6 (a) encoding said macroblock employing at 

7 least one controllable parameter; and 

8 (b) adapting said encoding of said 

9 macroblock when said macroblock is a still 

10 macroblock, said adapting including adjusting 

11 said at least one controllable parameter 

12 employed in encoding said still macroblock to 

13 minimize pulsation artifacts between 

14 corresponding still macroblocks of adjacent 

15 frames in said sequence of video frames. 

1 20. The method of claim 19, further comprising 

2 determining whether said macroblock comprises said 

3 still macroblock . 

1 21. The method of claim 19, wherein said 

2 adapting of said encoding comprises encoding said 

3 still macroblock as a skip macroblock and assigning a 

4 minimum quantization level to said skip macroblock. 

1 22. The method of claim 20, wherein said 

2 assigning of the minimum quantization level comprises 

3 assigning an average quantization level of a 

4 reference frame to said frame as said minimum 

5 quantization level for said still macroblock. 
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1 23. A system for encoding a sequence of video 

2 frames comprising: 

3 a pre-encode processing unit, said pre- 

4 encode processing unit comprising: 

5 a statistics measurement unit for use 

6 in determining whether a current frame of 

7 the sequence of frames comprises a still 

8 frame; 

9 a control unit for modifying at least 

10 one controllable parameter employed in 

11 encoding said still frame to minimize 

12 pulsation artifacts between still frames of 

13 a sequence of still frames when said 

14 statistics measurement unit determines said 

15 current frame to comprise said still frame; 

16 and 

17 an encoding engine for encoding said 

18 current frame of the sequence of video frames 

19 using the at least one controllable encode 
2 0 parameter set by said pre-encode processing 
21 unit. 

1 24. The system of claim 23, wherein said 

2 statistics measurement unit comprises means for 

3 determining whether said current frame comprises a 

4 reference still frame, and if so, said control unit 

5 comprises means for increasing a target bitrate used 

6 by the encoding engine to encode said reference still 

7 frame. 
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1 25. The system of claim 24, wherein said means 

2 for increasing said target bitrate comprises means 

3 for transferring target bits from a subsequent still 

4 frame in a sequence of still frames to said reference 

5 still frame, said reference still frame also being in 

6 said sequence of still frames. 

1 26. The system of claim 23, wherein each frame 

2 of said sequence of video frames comprises a 

3 plurality of macroblocks, and wherein said control 

4 unit of said pre-encode processing unit further 

5 comprises means for setting zero motion vector mode 

6 ON upon determining that said frame comprises a still 

7 frame, and means for determining a predictive error 

8 for each macroblock of said still frame, and for each 

9 macroblock of said still frame for setting its 

10 predictive error to zero making said macroblock a 

11 skip macroblock when its determined predictive error 

12 is less than a predetermined value representative of 

13 a still macroblock. 

1 27. The system of claim 26, wherein said 

2 control unit further comprises means for maintaining 

3 a minimum quantization level for encoding each skip 

4 macroblock of said still frame. 

1 28. The system of claim 27, wherein said means 

2 for maintaining comprises means for maintaining said 

3 minimum quantization level for each skip macroblock 

4 of said still frame at an average quantization level 

5 of a reference still frame (Av QL ref ) for said 

6 sequence of still frames. 
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1 29. The system of claim 23, wherein each frame 

2 of the sequence of video frames comprises a plurality 

3 of pixels, and wherein each pixel of each frame 

4 comprises a multi-bit value, and said statistics 

5 measurement unit comprises means for: 

6 determining for a current frame (k+1) of 

7 the sequence of frames a summation statistic 

8 (PIX-SUM k+1 ) derived from said multi-bit values 

9 of the plurality of pixels of the current frame; 

10 determining a summation statistic (PIX- 

11 SUM k ) derived from the multi-bit values of the 

12 plurality of pixels of a prior frame (k) 

13 preceding the current frame (k+1) in the 

14 sequence of video frames; and 

15 determining whether: 

16 |PIX-SUM k - PIX-SUM k+1 | < X 

17 where X is a predefined value representative of 

18 a still frame. 
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1 30. The system of claim 29, wherein said 

2 control unit further includes means for: 

3 determining an accumulated absolute 

4 difference derived from adjacent pixels of said 

5 plurality of pixels of the current frame (PIX- 

6 DIFF k+1 ) ; 

7 determining an accumulated absolute 

8 difference derived from adjacent pixels of said 

9 prior frame (PIX-DIFF k ) ; and 

10 determining whether: 

11 |PIX-DIFF k - PIX-DIFF k + 1 | < Y 

12 wherein Y is a predefined value, and 

13 wherein said current frame is determined to 

14 comprise said still frame if both |PIX-SUM k - 

15 PIX-SUM k+1 | < X and | PIX-DIFF k - PIX-DIFF k+1 | < Y 

16 are true. 

1 31. The system of claim 23, wherein when said 

2 current frame comprises other than said still frame, 

3 said control unit further comprises means for 

4 adapting encoding each of at least one macroblock of 

5 said current frame when said macroblock comprises a 

6 still macroblock, said means for adapting including 

7 means for adjusting said at least one controllable 

8 parameter employed in encoding said still macroblock 

9 to minimize pulsation artifacts between corresponding 

10 still macroblocks of adjacent frames in said sequence 

11 of video frames. 
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1 32. The system of claim 31, wherein said means 

2 for adapting comprises means for confirming that said 

3 still macroblock is other than an edge macroblock and 

4 is a non- intra macroblock, and that said frame 

5 comprises a P frame, and wherein said control unit 

6 further includes means for determining that a motion 

7 vector for said still macroblock is zero and a 

8 macroblock difference (MBD) value is less than a 

9 predefined value, and when true, said encoding engine 

10 further comprises means for encoding said still 

11 macroblock as a skip macroblock having a minimum 

12 quantization level defined by a quantization level of 

13 a reference frame for said P frame. 

1 33. The system of claim 31, wherein said means 

2 for adapting comprises means for confirming that said 

3 still macroblock is other than an edge macroblock and 

4 is a non- intra macroblock, and that said frame 

5 comprises a B frame, and wherein said control unit 

6 further comprises means for determining that a motion 

7 vector for said still macroblock is equal to a motion 

8 vector of a previous macroblock in the B frame and 

9 that a macroblock difference (MBD) value is less than 

10 a predefined value, and when true, said encoding 

11 engine further comprises means for encoding said 

12 still macroblock as a skip macroblock having a 

13 minimum quantization level to said skip macroblock, 

14 defined by a quantization level of a reference frame 

15 for said B frame. 
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1 34. A system for encoding a macroblock of a 

2 plurality of macroblocks of a frame in a sequence of 

3 video frames, said system comprising: 

4 an encoding engine for encoding said 

5 macroblock of said frame using at least one 

6 controllable encode parameter; and 

7 means for adapting said encoding of said 

8 macroblock when said macroblock is a still 

9 macroblock, said adapting including means for 

10 adjusting said at least one controllable 

11 parameter employed in encoding said still 

12 macroblock to minimize pulsation artifacts 

13 between corresponding still macroblocks of 

14 adjacent frames in said sequence of video 

15 frames. 

1 35. The system of claim 34, further comprising 

2 means for determining whether said macroblock 

3 comprises said still macroblock. 

1 36. The system of claim 35, wherein said means 

2 for adapting of said encoding comprises means for 

3 adjusting encoding of said still macroblock to encode 

4 said still macroblock as a skip macroblock and to 

5 encode said skip macroblock with a minimum 

6 quantization level defined by a quantization level of 

7 a reference frame to said frame having said still 

8 macroblock. 
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1 37. A computer program product comprising a 

2 computer usable medium having computer readable 

3 program code means therein for use in encoding a 

4 sequence of video frames, said computer readable 

5 program code means in said computer program product 

6 comprising for each frame of the sequence of video 

7 frames: 

8 computer readable program code means for 

9 causing a computer to affect determining whether 

10 said frame comprises a still frame; 

11 computer readable program code means for 

12 causing a computer to affect encoding said frame 

13 employing at least one controllable encode 

14 parameter; and 

15 computer readable program code means for 

16 causing a computer to affect adapting said 

17 encoding of said frame when said determining 

18 determines said frame to be said still frame, 

19 said adapting including adjusting said at least 
2 0 one controllable parameter employed in encoding 

21 said still frame to minimize pulsation artifacts 

22 between still frames of a sequence of still 

23 frames within said sequence of video frames, 

24 wherein said still frame comprises one still 

25 frame of said sequence of still frames. 
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1 38. The computer readable program code means of 

2 claim 37, wherein said computer readable program code 

3 means for causing a computer to affect determining 

4 comprises computer readable program code means for 

5 causing a computer to affect determining whether said 

6 frame comprises a reference (I) still frame for said 

7 sequence of still frames, and when so, said computer 

8 readable program code means for causing a computer to 

9 affect adapting comprises computer readable program 

10 code means for causing a computer to affect 

11 increasing a target bitrate to be used by said 

12 encoding to encode said reference still frame by 

13 moving target bits from at least one subsequent still 

14 frame of said sequence of still frames to said 

15 reference still frame. 

1 39. The computer readable program code means of 

2 claim 38, wherein each frame of said sequence of 

3 video frames comprises a plurality of macroblocks, 

4 and wherein said computer readable program code means 

5 for causing a computer to affect adapting comprises 

6 computer readable program code means for causing a 

7 computer to affect setting a zero motion vector mode 

8 ON upon determining that said frame comprises said 

9 still frame, and for determining a predictive error 

10 for each macroblock of said still frame, and when 

11 said predictive error is less than a predetermined 

12 value, for setting said predictive error to zero 

13 making said macroblock a skip macroblock. 
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1 40. The computer readable program code means of 

2 claim 39, wherein said computer readable program code 

3 means for causing a computer to affect adapting 

4 comprises computer readable program code means for 

5 causing a computer to affect maintaining a minimum 

6 quantization level for encoding each skip macroblock 

7 of said still frame, wherein said minimum 

8 quantization level comprises an average quantization 

9 level of a reference still frame for said sequence of 
10 still frames. 
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1 41. The computer readable program code means of 

2 claim 37, wherein said frame comprises a plurality of 

3 macroblocks, and wherein said computer readable 

4 program code means for causing a computer to affect 

5 determining comprises computer readable program code 

6 means for causing a computer to affect determining 

7 whether said frame comprises a motion frame, and when 

8 so, said computer readable program code means further 

9 comprises for each of at least some macroblocks of 

10 said motion frame: 

11 computer readable program code means for 

12 causing a computer to affect determining whether 

13 said macroblock comprises a still macroblock; 

14 computer readable program code means for 

15 causing a computer to affect encoding said 

16 macroblock employing at least one controllable 

17 parameter; and 

18 computer readable program code means for 

19 causing a computer to affect adapting said 

20 encoding of said macroblock when said macroblock 

21 comprises said still macroblock, said adapting 

22 including adjusting said at least one 

23 controllable parameter employed in encoding 

24 said still macroblock to minimize pulsation 

25 artifacts between corresponding still 

2 6 macroblocks of adjacent frames in said sequence 

27 of video frames. 



★ * * * * 
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ADAPTIVE ENCODING OF A SEQUENCE OF STILL FRAMES OR 
PARTIALLY STILL FRAMES WITHIN MOTION VIDEO 

Abstract of the Disclosure 

Method, system and computer program product are 
5 provided for adaptively encoding in hardware, 

software or a combination thereof a series of still 
or partially still pictures using motion video 
encoding. A pre -encode statistics measurement unit 
is employed to derive statistics on each frame of the 

10 sequence of video frames to be encoded. The 

statistics are determinative of whether the frame 
comprises a still frame. If so, at least one 
controllable parameter to be used to encode the still 
frame is modified, and an encoding engine employs the 

15 at least one controllable parameter to encode the 

still frame so that pulsation artifacts are prevented 
between still frames of the series of still frames. 
Partial still picture encoding to prevent pulsation 
artifacts on a macroblock level is also addressed. 

20 
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